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Abstract. We introduce two models of inclusion hierarchies: Random Graph Hierarchy (RGH) and Limited 
Random Graph Hierarchy (LRGH). In both models a set of nodes at a given hierarchy level is connected 
randomly, as in the Erdos-Renyi random graph, with a fixed average degree equal to a system parameter 
c. Clusters of the resulting network are treated as nodes at the next hierarchy level and they are connected 
again at this level and so on, until the process cannot continue. In the RGH model we use all clusters, 
including those of size 1, when building the next hierarchy level, while in the LRGH model clusters of size 
1 stop participating in further steps. We find that in both models the number of nodes at a given hierarchy 
level h decreases approximately exponentially with h. The height of the hierarchy H, i.e. the number of 
all hierarchy levels, increases logarithmically with the system size N, i.e. with the number of nodes at the 
first level. The height H decreases monotonically with the connectivity parameter c in the RGH model 
and it reaches a maximum for a certain Cmax in the LRGH model. The distribution of separate cluster 
sizes in the LRGH model is a power law with an exponent about —1.25. The above results follow from 
approximate analytical calculations and have been confirmed by numerical simulations. 

PACS. 89.75.-k Complex systems 


1 Introduction 

Hierarchical structures can be found in many forms in 
many real world systems. Four general classes of hierar¬ 
chical systems can be distinguished [?] - order, control, 
inclusion and level hierarchies. An order hierarchy is a 
set of units ordered by an internal variable attributed to 
these units, e.g. a company income, a book size or a simple 
social rank [?]. Control hierarchies describe control rela¬ 
tions such as boss-subordinate [?] or leader-followers [?] 
and are usually represented by directed graphs [?,?]. In¬ 
clusion hierarchies [?,?,?] correspond to stuctures where 
a unit of a higher level includes several units of a lower 
level, e.g. a university includes faculties or an army divi¬ 
sion includes regiments. Level hierarchies can be treated 
as a special class of inclusion hierarchies when interacting 
elements of a lower level collectively form elements of a 
higher level and the higher level elements possess some 
emergent properties absent at the lower level [?]. Exam¬ 
ples are cells forming tissues and organs or macromolecules 
forming internal cell structures. The concept of inclusion 
and level hierarchies is close to the community structure 
of networks [?,?] that is defined through a network topol¬ 
ogy, and to recently studied networks of networks [?,?,?] 
The original community structure measures considered a 
simple division of nodes into disjoint communities. This 
has changed, as more recent works focus on overlapping 


communities, that are able to form an inclusion hierarchy 
[?,?]. In fact, Clauset [?] introduces a very general model 
of a hierarchical graph. His model is descriptive in nature, 
as it does not determine connection probabilities across 
the hierarchy. Hierarchical community organization has 
been also considered for several dynamical models [?,?,?]. 
In this paper, we introduce two models of inclusion hier¬ 
archies, with a specified mechanism of hierarchy organiza¬ 
tion. The models are not strictly network models but can 
be interpreted in terms of nested community structures, 
or in terms of networks of networks featuring a nested hi¬ 
erarchical topology. While the first investigated model is 
straightforward, the second one shows a nontrivial depen¬ 
dence of the total number of levels on system parameters. 
In the following sections we define the RGH and LRGH 
models and derive their basic properties, mainly the total 
number of hierarchy levels that exist in the system. 


2 Random Graph Hierarchy 

We propose a hierarchical system consisting of N nodes 
which are organized into many nested clusters. Our idea 
is to recursively repeat the connection procedure of the 
well-known Erdos-Renyi graph and treat connected clus¬ 
ters created at each level as nodes at a next level. The 
algorithm for creating such structure is following: 
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1. Link together each pair of nodes with a probability 
Po = c/{N — 1). The result is W\ clusters of the hierar¬ 
chical degree h = 1. The nodes belonging to the same 
cluster are considered as neighbours, so their mutual 
degree of neighbourhood is one. 

2. The existing clusters of hierarchical degree h = 1 are 
treated as primary-level nodes which are linked to¬ 
gether randomly with a probability pi = c/(W\ — 1) 
per pair. The result is W -2 clusters of the hierarchi¬ 
cal degree h = 2, containing sets of clusters of degree 
h = 1. If nodes i and j belong to the same cluster with 
the hierarchical degree h = 2, but to different clusters 
of the hierarchical degree h = 1, their mutual degree 
of neighbourhood is two. 

3. Repeat step 2 for next hierarchy levels h = 3,4, 5... 
until only one cluster remains. The total number of 
iterations H required is proportional to the logarithm 
of the system size H ~ In TV. 

The model possesses only two parameters: N - the to¬ 
tal number of nodes and c - the mean number of connec¬ 
tions per node created at each level. Note that making the 
parameter c constant instead of connection probability p 
known from E-R model means that despite a changing 
number of clusters Wf,. the connection density is always 
the same in relation to the percolation threshold ((k) = 1 
in random graphs) at each level hs. The resulting struc¬ 
ture will be called Random Graph Hierarchy (RGH). The 
average number of clusters Wh of hierarchical degree h de¬ 
creases as Wh = Na h , where a = a(c) = 1 — c/2. This is 
due to the average number of clusters in a random graph 
of N nodes equal to n c = N( 1 — c/2) + 0(1) (this is true 
only below the percolation threshold, i.e. for c < 1) [?]. 



Fig. 1. The building process of the Random Graph Hierarchy 
(RGH). During each step nodes/clusters are randomly con¬ 
nected (as in the Erdos-Renyi random graph) to form clusters. 
The clusters are connected in the next step in the same way 
and so on. The process continues until the whole system is 
aggregated in a single cluster. The coloured ellipses show the 
clusters (red for level 1, green for level 2 and blue for level 3). 


The total number of hierarchies H fulfills the following 
equation: 

W H = N(1 - °-) H = 1. (1) 


It follows that 


H = - 


In N 

ln(l — c/2) 


(2) 


Eq. [2] is only an approximation since it is based on the 
average number of clusters in E-R graph which is correct 
only for large N what is not fulfilled in the case of h close 
to H. 



N c 

Fig. 2. The highest level of hierarchy H as a function of the 
system N (left picture, c = 0.8, averaged over 500 realizations) 
and average degree c (right picture, N = 5000, averaged over 
500 realizations) for RGH model. The lines are analytical re¬ 
sults (Eq. [2]). Error bars are smaller than symbol sizes. 


Note that while the resulting structure is a graph at 
each hierarchy level, it does not possess well-defined links 
as a whole, since higher-level links are between clusters, 
not between individual nodes. 


3 Limited Random Graph Hierarchy 

In the RGH model each node/cluster is given many chances 
to join other elements. Even if an element of level h does 
not connect with any other, it advances to level h +1 as a 
cluster of the size one. We present an alternative approach 
in the Limited Random Graph Hierarchy (LRGH) where 
only clusters that have merged at the level h can partic¬ 
ipate in the merging at the level h + 1. Otherwise they 
drop out and do not participate in further merging. In ef¬ 
fect the procedure of the cluster growth is limited only to 
the groups which are continually developing. Aside from 
dropping out nodes/clusters that do not find partners, the 
procedure is the same as for the RGH model and has the 
same two parameters: N and c. The procedure is as fol¬ 
lows: 

1. Link together each pair of nodes with the probability 
Po = c/{N — 1). The result is W\ clusters (of size at 
least two) which advance to the second step. Unlinked 
nodes are the clusters of the size one and they do not 
participate in the further steps (and they are not in¬ 
cluded in Wi). 

2. Merge together each pair of clusters with the probabil¬ 
ity Pi = c/ {W\ — 1). The result of merging two clusters 
is a new cluster which contains these sub-clusters and 
whose size is the sum of the sizes of the merged clus¬ 
ters. The new clusters (IT 2 ) advance to the next step. 
The clusters which did not merge during this step do 
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not advance and stop growing (and they are not in¬ 
cluded in W 2 ). 

3. Repeat the step 2 for the clusters of successive levels 
until all clusters stop growing. 



Fig. 3. Multilevel growth of clusters in the LRGH model. 
The nodes/clusters at every level are randomly connected 
(as in Erdos-Renyi random graph) to form clusters. The 
nodes/clusters which did not connect and therefore remain 
alone become inactive and are exempt from further dynamics. 
The clusters formed from the successful connections remain ac¬ 
tive and are connected in the next step in the same way. The 
process continues until no active clusters remain. The coloured 
ellipses show the clusters (red for level 1, green for level 2 and 
blue for level 3). 


Similarly to RGH model, the number of clusters on 
given hierarchy level Wh and the maximum hierarchy level 
H can be calculated analytically. At the beginning we find 
the average number of clusters Wh of level h (which ad¬ 
vanced to h + 1-th step, obviously Wo = N). Consider the 
first step of the above procedure. As previously noted in 
Sect. [H the average number of clusters resulting from the 
linking together each pair of Wo nodes with the probabil¬ 
ity po = c/(Wo — 1) is no = Wo(1 —c/2), including clusters 
of the size one (single, unlinked nodes). The average num¬ 
ber of unlinked nodes Rq is given by the probability of 
Wo — 1 failures to link multiplied by the number of all 
nodes: 


Ro = W 0 



c 

Wo-1 


Wq —1 


( 3 ) 


therefore the average number of clusters Wi which ad¬ 
vanced is the difference between no and Rq: 


Wi = no — Rq = Wq 



(1 c 

1 

7 

£ 

{ Wo-1 



(4) 


leading to a recursive equation for Wh- 


W h 


W h - X 




Wh-i-1' 


( 5 ) 


Since Eq. [5] does not depend explicitly on h , we assume 
that Wh -1 is large enough to use the approximation 


(1 


L \W h - i-l 
W/_i 1 


( 6 ) 


which is independent from h and Wh- The accuracy of 
this approximation depends mainly on Wh- 1 , the error 
not exceeding 10% when Wh -1 > 7. In practice, for all 
lower hierarchy levels with their numerous clusters, it is 
a very good approximation, while introducing some error 
at the top level. Since the number of clusters at the last 
level H — 1, where connections still appear, is around 7 
or higher, the error of EqE]is at most 10%, but it causes 
upwards to 20% discrepancy in the following Eq. [3 Note 
that this error causes a systematic underestimation of Wh 
(and in consequence a corresponding underestimation of 
H). The error is smaller for smaller c and is independent 
of the system size N and the number of hierarchy levels 
H. 

Using the approximation (Eq. [Gj we obtain 

W h = N( l-|-e" c )\ (7) 


which describes the dependence of number of active clus¬ 
ters on the hierarchy level h. We can invert this equation, 
calculating the hierarchy level h of a system with Wh clus¬ 
ters. Then, using it for h = H we can write 


H(c,N ) 


In ((W h )/N) 
ln(l — |-e-)- 


( 8 ) 


Let us first assume that (Wh) = 1, meaning that the last 
hierarchy always consists of a single cluster merged in the 
previous step. If we do so, then Eq. [5] depends on the value 
of c only through the argument of the logarithm, and pos¬ 
sesses a maximum for the argument c ma x = In 2 « 0.6931. 
While this prediction reproduces the general shape of the 
relation correctly, both the values and the position of the 
maximum are underestimated. Using a different terminat¬ 
ing constant than (Wh) = 1 changes the height, but the 
position of the maximum remains the same. 

To get a better approximation, it is necessary to find a 
more precise value of (Wh)- The procedure of LRGH cre¬ 
ation stops when none of the existing clusters connects 
with any other. The probability that it happens depends 
on the number of participating clusters W and the con¬ 
stant c: 


W(W- 1) 

= ’ • < 9 > 

This probability is conditional on the number of clusters 
W present in the system. The probability that the system 
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passes through a state with a given number of clusters 
W is equal to the probability that it has not finished the 
dynamics at earlier steps. However, since the system does 
not visit each number of clusters W in order, but visits 
only certain values during a single realization, we approx¬ 
imate it by treating it as if the system visits all of them, 
but spends less than one step in each. Effectively we con¬ 
sider h as the time, since one procedure step changes it 
by 1 each time. Since the system spends less time in each 
W, the probability P c of terminating the process should 
be multiplied by the time spent at a given value of W. It 
can be approximated as the inverse of the absolute value of 
derivative AW/Ah. Using Eq.[7]for W(h), we can write the 
conditional probability of terminating the process with W 
clusters scaled by the time spent with W clusters during 
the evolution 


Pct(W,c) = 


Ah 


AW 


<io> 


Now the probability to complete the dynamics at a given 
number of clusters W is 


N 

P a (W,c)=P ct (W,c) (1 -Pct(k,c)). (11) 

k=W +1 


Unfortunately this probability is not normalized, since our 
approach does not describe the behaviour of the system 
correctly after it arrives at W = 1. We therefore treat it 
as the probability to terminate before W = 1 (it is always 
lower than 1). If the system manages to merge down to 
one cluster, then the process stops, as any further merg¬ 
ing is impossible. We therefore assume that the missing 
probability is the chance to stop at W = 1. This leads to 
the final formula for the average number of clusters at the 
hierarchy H 


N N 

(Wh) = J2 WP a(W,c) + P «W C ))' ( 12 ) 

W=2 W -2 

where the first term is average of W over probabilities 
P a (W,c ) for W > 1, while the second term is the prob¬ 
ability to stop at W = 1. Because of the high algebraic 
complexity, we have not managed to express Eq. [12] in 
a simple analytical form, and instead we have calculated 
this value numerically. We have observed that while the 
products and series presented by Eq. [l^] and Eq. ]TT] for¬ 
mally depend on TV, in practice the terms quickly become 
very small for large W and (Wh) attains a limit value 
independent of TV. We used only the first 100 terms (as 
if TV = 100) in our study, which for c > 0.1 produced an 
error no larger than 0.05 that quickly decreases with c (for 
c = 0.5 it is less than 10 -10 ). Figure [I] shows that despite 
many approximations and assumptions used, the value of 
(Wh) calculated this way is very close to what is observed 
during numerical simulations. 

Putting Eq. [12] into Eq. [H] we obtain the value H (c, TV) 
that reproduces the numerical results with a greater ac¬ 
curacy than the value H (c, TV) based on the assumption 



Fig. 4. The average number of clusters observed at the final 
level H. The crosses are numerical values averaged over 2000 
realizations (TV = 5000). The line is an analytical result calcu¬ 
lated using Eq. 1121 Error bars are smaller than symbol sizes. 


(Wh) = 1, as seen in Fig. [5] However the discrepancies 
with the numerical results are still visible. The numeri¬ 
cal maximum of H(c ) is slightly shifted towards c = 1, 
i.e. for TV = 5000 it is about 0.8, while the analytical 
value c ma x( TV = 5000) = 0.7631. These differences origi¬ 
nate from two sources: we take the average value of Wh 
(although this is a very good approximation, as seen in 
Fig. O and we use the approximation shown in Eq. [6] 
which introduces a substantial error for the last hierar¬ 
chies. The position of the maximum c max , due to (Wh) 
depending on c, now depends weakly on the system size 
TV (i.e. c max (TV = 100) = 0.8178). 
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Fig. 5. The highest level of hierarchy TV as a function of the 
system size TV (left picture, c = 0.8, averaged over 500 realiza¬ 
tions) and the connectivity c (right picture, TV = 5000, aver¬ 
aged over 2000 realizations) for LRGH model. The red lines are 
analytical results (combined Eq.[S]and Eq. 1121) . while the blue 
line (right picture) shows Eq. [8] under assumption (Wh) = 1 . 
Error bars are smaller than symbol sizes. 



Unlike in the RGH model, where the clusters merge 
until only a single one is left at the level H, in the LRGH 
model clusters stop participating in further merging at 
the moment when they fail to form any connection in a 
given step. This means that the final structure consists of 
many disconnected clusters. We investigated the distribu¬ 
tion of these clusters and observed a dependence similar 
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to a power law (Fig. [Gj. Due to the a relatively small ex¬ 
ponent (a ps —1.25) in a single realization of the system 
the power-law tail is disrupted by existence of the largest 
cluster. This behavior is seen in Fig. [Gj where single re¬ 
alizations feature one or few very large clusters that may 
dominate the whole system, and the peaks correspond¬ 
ing to these clusters are often clearly separated from the 
quasi-continuous part of the distribution. 



Fig. 6. The distribution of cluster sizes for LRGH model (c = 
0.8, N = 5000). Circles, triangles and squares are examples of 
single realizations. The black solid line shows the distribution 
averaged over 1000 realizations. The gray straight line is a 
power-law with the exponent a = —1.25, just for reference. 


larger regional powers or corporations arising from pro¬ 
gressive mergers of smaller companies. Our models gen¬ 
erate structures that possess several levels of hierarchical 
organization. While some real systems are shallow, with 
only one or two levels, in many of them several levels could 
be distinguished, such as labs, divisions, faculties and fi¬ 
nally universities or townships, counties, individual states 
and the whole of United States. Of particular interest may 
be the fact that in the LRGH model the final system can 
consist of clusters that stopped growing at various lev¬ 
els. They can represent different organizational complex¬ 
ity, similar to countries that depending on their size (but 
not only) could have different depths of organization, how¬ 
ever they are usually still considered same-level entities. 
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4 Conclusions 

We have introduced two models of hierarchical structures, 
based on random connections between the nodes (as in 
Erdos-Renyi graphs) at different hierarchy levels h, with 
a constant mean node degree independent from h. Both 
models show an approximately exponential decrease of the 
number of elements at successive levels, and possess a well 
defined maximum hierarchy level H (the height of the hi¬ 
erarchy). The height H is proportional to the logarithm 
of the number of nodes N. It decays monotonicaly with 
the connectivity parameter c in the Random Graph Hier¬ 
archy model, but possesses a maximum as a function of 
this parameter for the Limited Random Graph Hierarchy 
model. The obtained analytical results are supported by 
numerical simulations. 

The models can be used for investigation of opinion dy¬ 
namics or other collective processes [?] on hierarchical 
structures. Since many real social systems are hierarchical 
[?] the models may offer a simple representation for them. 
The proposed models could be also considered as reference 
models for investigations of inclusion hierarchies [?,?,?,?]. 
We expect they may be useful for cases where a system 
self-organizes into a hierarchy, example being countries, 
which can arise from settlements joining into progressively 





